Bi-LSTM Model to Recognize Human Activities in UAV Videos using Inflated I3D-ConvNet

نویسندگان

چکیده

Human activity recognition in aerial videos is an emerging research area. In this paper, Inflated I3D-ConvNet (Inflated I3D) and Bidirectional Long Short-Term Memory (Bi-LSTM) based human action model UAV have been proposed. The initial module was pre-trained using the Kinetics-400 video dataset, which consisted of 400 classes activities around clips for each class culled from real-world arduous YouTube videos. proposed inflated built on 2D-ConvNet inflation learns extracts spatio-temporal features while leveraging architectural design Inception-V1. employs Bi-LSTM architecture classification Drone-Action dataset a smaller benchmark UAV-captured dataset. This considerably improves state-of-the-art results SoftMax classifier retains accuracy about 98.4%.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Person re-identification in UAV videos using releavance feedback

In this paper we present an approach to recognize individual persons in aerial video data. We start from a baseline approach that matches a number of color and texture image features in order to find possible matches to a query person track. This approach is improved by incorporating operator feedback in three ways. First, features are being weighted based on their degree of agreement (correlat...

متن کامل

Modeling Scenes and Human Activities in Videos

In this dissertation, we address the problem of understanding human activities in videos by developing a two-pronged approach: coarse level modeling of scene activities and fine level modeling of individual activities. At the coarse level, where the resolution of the video is low, we rely on person tracks. At the fine level, richer features are available to identify different parts of the human...

متن کامل

Analysis of Gait Pattern to Recognize the Human Activities

Human activity recognition based on the computer vision is the process of labelling image sequences with action labels. Accurate systems for this problem are applied in areas such as visual surveillance, human computer interaction and video retrieval. The challenges are due to variations in motion, recording settings and gait differences. Here we propose an approach to recognize the human activ...

متن کامل

Learning to Recognize Human Activities from Soft Labeled Data

An activity recognition system is a very important component for assistant robots, but training such a system usually requires a large and correctly labeled dataset. Most of the previous works only allow training data to have a single activity label per segment, which is overly restrictive because the labels are not always certain. It is, therefore, desirable to allow multiple labels for ambigu...

متن کامل

Incremental Parsing with Minimal Features Using Bi-Directional LSTM

Recently, neural network approaches for parsing have largely automated the combination of individual features, but still rely on (often a larger number of) atomic features created from human linguistic intuition, and potentially omitting important global context. To further reduce feature engineering to the bare minimum, we use bi-directional LSTM sentence representations to model a parser stat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Advanced Computer Science and Applications

سال: 2022

ISSN: ['2158-107X', '2156-5570']

DOI: https://doi.org/10.14569/ijacsa.2022.01312111